Scaling and efficiency determine the irreversible evolution of a market 
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Abstract: 



In setting up a stochastic description of the time evolution of a financial index, the challenge 
consists in devising a model compatible with all stylized facts emerging from the analysis of 
financial time series and providing a reliable basis for simulating such series. Based on constraints 
imposed by market efficiency and on an inhomogeneous-time generalization of standard simple 
scaling, we propose an analytical model which accounts simultaneously for empirical results like 
the linear decorrelation of successive returns, the power law dependence on time of the volatility 
autocorrelation function, and the multiscaling associated to this dependence. In addition, our 
approach gives a justification and a quantitative assessment of the irreversible character of the 
|/-\ index dynamics. This irreversibility enters as a key ingredient in a novel simulation strategy of 

^vj ■ index evolution which demonstrates the predictive potential of the model. 
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Introduction 

For over a century it has been recognized [lj that the 
unpredictable time evolution of a financial index is in- 
herently a stochastic process. However, in spite of many 
efforts 0, i, H H i, 0, S S 0, El, a unified frame- 
work for simultaneously understanding empirical facts 
QS El El EE EE El EI El , such as the non-Gaussian 
form and multiscaling in time of the distribution of re- 
turns, the linear decorrelation of successive returns, and 
volatility clustering, has been elusive. This situation 
occurs in many natural phenomena, when strong cor- 
relations determine various forms of anomalous scaling 
[H M, S3, SI SI SE SE S3 • Here, by employing novel 
mathematical tools at the basis of a generalization of 
the central limit theorem to strongly correlated variables 
[28|] , we propose a model of index evolution and a corre- 
sponding simulation strategy which account for all robust 
features revealed by the empirical analysis. 

Let S(i) be the value of a given asset at time t. The 
logarithmic return over the interval [t, t + T] is defined 
as r(t,T) = ]nS(t + T)-hxS(t), where t = 0,1,... 
and T — 1,2..., in some unit (e.g., day). From a suf- 
ficiently long historical series one can sample the em- 
pirical probability density function (PDF) of r over a 
time T, p T (r), and the joint PDF of two successive re- 
turns r\ = r(t,T) and r 2 = r(t + T,T), denoted by 
p 22 ;(ri, r 2 ). This joint PDF contains the information on 
the correlation between r\ and r 2 in the sampling. A 
well established property El El EE El] is that, if T is 
longer than tens of minutes, the linear correlation van- 
ishes: / p 2 Mri,r 2 )r 1 r 2 dridr2 = (rir 2 )_(2) = 0. This is 

a consequence of the efficiency of the market [3[ , which 
quickly suppresses any arbitrage opportunity. Another 
remarkable feature is that, within specific T-ranges, p T 
approximately assumes a simple scaling form 
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where g and D are the scaling function and exponent, 
respectively. Eq. {1} manifests self-similarity, a symme- 
try often met in natural phenomena [2CJ, SH S3) HE S3 : 
plots of T D p T vs r/T D for different T's collapse onto the 
same curve representing g. We verify the scaling ansatz 
in Eq. ((T|) for the Dow Jones Industrial (DJI) index using 
a dataset of more than one century (1900-2005) of daily 
closures. This index is paradigmatic of market behavior 
and the considerable number of data reduces sampling 
fluctuations substantially. In Fig. [T] the collapse of the 
empty symbols is rather satisfactory (the explanation of 
the meaning of the full symbols in Fig. 1 is given be- 
low). The sca ling function in Eq. fl} is non-Gaussian 
[I 51 El El EE EE E3- Although linear correlations 
vanish, in the T-range considered g is determined by the 
strong nonlinear correlations of the returns. Only for 
T > t c (with r c of the order of the year) successive index 
returns become independent and Pr turns Gaussian in 
force of the central limit theorem [29fl . 



Results and discussion 

Our first goal is to establish up to what extent the as- 
sumption of simple scaling in Eq. ([I]) does constrain the 

i 2 \ 

structure of the joint PDF p 2 rf. One must of course have 
Jp 2 2 ^(r 1 ,r 2 )S(r - n - r 2 )dr 1 dr 2 = P 2T {r), 



s ( 2 



fp 2 T{r 1 ,r 2 )dr 2 =p T {r 1 ), 



(2) 



Jp 2 T(n : r 2 )dri =p T (r 2 ). 



Indeed, the first line of Eqs. © follows from r(t, 2T) = 

/ 2 \ 

ri+r 2 . Furthermore, since the joint PDF p 2 rf is sampled 

from a sequence of time-translated intervals of duration 
2T along the historical series, both the first and the sec- 
ond halves of all such intervals provide an adequate sam- 
pling basis for p T . This justifies the second and third lines 
of Eqs. ([2]). At this point we notice that the property 

((ri + r 2 ) 2 } (2 ) = 

T 2D . Hence, we 

1/2. Remarkably, for 
is found to scale con- 
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all developed market indices, 

sistently with a D pretty close to 1/2 |30j. 

By switching to Fourier space in Eq. ^, the notion 

of a novel, generalized product operation allows to iden- 

i 2 \ 

tify a solution for p 2 rf in terms of p T alone. While the 

ordinary multiplication of characteristic functions (i.e., 

Fourier transforms) of Gaussian p T 's would yield triv- 

(2) 

ially the correct p 21 i in the case of independent succes- 
sive returns [291 ] . the generalized product is used here 
to take into account strong nonlinear correlations con- 
sistently with the anomalous scaling they determine (see 
Supporting Information) and can be seen to be at the 
basis of a novel central limit theorem [28| ■ Our solution 
is strongly supported by the remarkable consistency with 
the numerical results and by the analogy with the inde- 
pendent case. In Fig. [2] we compare the PDF of the 
return r 2 conditioned to a given absolute value of the 
return r%, as obtained through our solution (continuous 
lines), with the empirically sampled one (symbols). The 
agreement does not involve fitting parameters, since D 
and those entering the assumed analytical form of ~g are 
already fixed in Fig. [T] 

At this point we must take into account that the sim- 
ple scaling ansatz in Eq. |T]) is only approximately valid 



Indeed, a consequence of Eq. |T]) is (|? 



/Pt 



rpql 



which we exploited above for q = 2. However, a careful 
analysis reveals that the g-th moment exponent deviates 
from the linear behavior qD ~ q/2 for q > 3 (empty 
circles in Fig. [3]). Like the linear behavior with slope 
1/2 observed for low-order moments, this multiscaling 
effect is common to most indices [30]. To explain this 
feature, we have to investigate the relation between the 
empirical p T and the stochastic process generating the 

— —(2) 

time series. If PDF's like p T and p 2 ^ were directly de- 
scribing such a process, this would be with stationary 
increments. This assumption is legitimate only for suf- 
ficiently long times, larger than r c . Below, we identify 
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FIG. 1: Data collapse for p T (r) (T measured in days) sampled 
from a record of about 2.7 x 10 4 DJI daily closures (empty 
symbols). The average daily trend of the order of 10~ 4 has 
been subtracted. The collapse analysis furnishes the scaling 
function ~g reported as the full line and the scaling exponent 
D ~ 1/2 (see also Fig. [3] and Supporting Information). The 
full symbols report the data collapse for the results of a single 
simulation of the DJI history. 



in the interplay between scaling and non-stationarity a 
precise mechanism accounting for the robust features of 
p T detected for T <C T c , including its multiscaling. 

Let us indicate by pt : r and p t 2T the ensemble PDF's 

corresponding to p T and p 27 ;, respectively. The addi- 
tional dependence on t, the initial time of the interval 
[t,t + T], shows that we do not assume stationarity for 
these PDF's. We postulate that, within specific T-ranges 
(e.g., the one in Fig. [1]), p ,T obeys a simple scaling like 
that in Eq. ([T]), but possibly with a D and a g different 
from D and 7/, respectively. One then realizes that this 
scaling and the linear decorrelation of returns impose on 

p t It constraints analogous to those for p\t m Eq. ©, 
except for the third one, which now reads 



P0 2 2T( r l' r 2) dr l 



PT.T{r2) =P0,aT(7 2 ) 



(3) 



This last condition tells us that, as a consequence of 
the nonlinear correlations, the effective time span of the 
marginal PDF obtained by integrating p^ 2T m r i must 
be renormalized by a factor a. This factor is determined 
again by consistency of the second moments scaling prop- 



erties, as above. Since now (\r\ ) PoT ~ T , one gets 
from Eq. © a = (2 2D - l) l / 2D . So, D ^ 1/2 implies 
a 7^ 1 and thus non-stationarity and irreversibility of the 
process. Similar functional relations hold for the PDF's 



FIG. 2: Conditional probabilities of daily returns T2 for dif- 
ferent values of n.. Empty symbols refer to the DJI data. 
The continuous curves are the predictions of our theory for 

P2T( r 2|kl|) = [P2T( r l' r 2) + P2T(" r l'' r 2)]//[P2T('-1^2) + 

P~2t(~ r ii r 2)]dr2- The absolute value of ri is introduced for 
reducing sample fluctuations. 



of the magnetization of critical spin models upon dou- 
bling the system size and can be explained in that con- 
text by the renormalization group theory [27| . Our gen- 
eralized multiplication of characteristic functions allows 

(2) 

us to express p\ 2T in terms of ptj- and to establish the 
time-inhomogeneous scaling property 
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It remains now to make explicit the link between the 
p's and the sampled p's and to determine D. By con- 
struction, p T is a t-average of pt.T- Since the time inho- 
mogeneity of pt.T must cross over into homogeneity for t 
exceeding r c , we expect the following approximation 



1 
p T (r) = — V Pt.T(r) 



(5) 



t=o 



to hold. Indeed, the history over which p T is sampled 
is much longer than r c and allows in principle also an 
indirect sampling ofp tj T if we simply assume Pt+r a ,T{r) — 
Pt,r{r)- 

In spite of the fact that Eq. ([J} implies a simple scal- 
ing exponent D for po,T> Eq. ([5]) leads to the remarkable 
property that, independently of D, the low-g moments 
of p T approximately scale with exponent q/2 as soon as 
r c 3> 1. Moreover, if D < 1/2, p T displays a multiscaling 
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FIG. 3: Scaling exponent of the q-th moment of p T . Empty 
(full) circles refer to the DJI data (simulation) of Fig. [l] The 
dashed line is q/2. Multiscaling is due to the deviation of 
D(q) from a constant value. The full line reports the time- 
averaged asymptotic (t c 2> 1) theoretical prediction based on 
Eq. (|j. 



of the same type as that found empirically: D(q) < 1/2 
for the high-order moments. The matching of the theo- 
retical predictions for the multiscaling of p T on the basis 
of Eq. ((5]) with the empirical results is a first way of 
identifying D. For the DJI, in Fig. [3] we show that with 
D = 0.24 this matching is very satisfactory. The scal- 
ing functions of Po,t and p T can also be shown to be 
simply related, once D is known. We notice that the 
observed multiscaling features of financial indices, which 
inspired multiplicative cascade models [g, [lj| in analogy 
with turbulence [20 . |21| , are explained here in terms of 
an additive process possessing the time-inhomogeneous 
scaling ((4|). 

The introduction of autoregressive schemes like ARCH 
|5| marked an advance in econometrics and financial anal- 
ysis [f| Q , and, more generally, in the theory of stochastic 
processes. In an autoregressive simulation a number of 
parameters weighting the influence of the past history on 
the PDF of the following return must be fixed through 
some optimization procedure. By our approach, a gen- 

(2) 

eralization of p t 2T to the case of n-consecutive intervals 
can be fully expressed just in terms of pt,T and D. This 
is obtained by taking the inverse Fourier transform of 
our solution for the characteristic function of the joint 
PDF (see Supporting Information). In this way we can 
precisely calculate the PDF which rules the extraction 
of the i-th return, r^, giving as conditioning inputs the 
previous m ones, 7"i_ m , . . . , fj-i. Consistently with our 
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time separation r (in days), 
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where t max + T — 1 is the total length of the time series. 
Symbols are as in Fig. [3] The simulation here is precisely 
the same we refer to in Figs. [T]and[3] The full line gives the 
slope of the time-averaged asymptotic (r c S> 1, r < r c ) model 
prediction for the volatility autocorrelation, superimposed to 
the data (see Supporting Information). 



schematization in Eq. |5|) , the existence of exogenous fac- 
tors acting on the market can be taken into account by 
resetting the width of the marginal PDF's with an (av- 
erage) periodicity equal to r c (see Supporting Informa- 
tion). The results for a single simulation with m = 100, 
t c = 500, and D = 0.24 are illustrated by the full sym- 
bols in Figs. [T] and [3) The coincidence of the scaling 
properties observed for the DJI with those of our simu- 
lation furnish a second strong indication of the validity 
of our approach and of the estimation of D. 

The correctness of the value of D can be further 
checked by considering the volatility autocorrelation 
function at time-separation r (Fig. 3]). A well estab- 
lished fact [H 111 H [IS ES El is its power law decay 
c(t) ~ t-P for r < r c , with /3 ~ 0.2 for the DJI. This be- 
havior is not reproduced by routine simulation methods 
in quantitative finance like GARCH 6] and requires the 
introduction of more sophisticated, fractional integration 
techniques [9j . The full characterization of the joint PDF 
of n-consecutive returns allows us to obtain a model ex- 
pression for c(t), which again takes into account the non- 
stationarity of the process (see Supporting Information) . 
Such an expression behaves asymptotically as a power of 
t with an exponent depending on D (c(t) is constant for 
D = 1/2 and decays for D < 1/2). In particular, with 



D = 0.24 both the model asymptotic expression and the 
results of our simulation procedure furnish a nice agree- 
ment with the exponent j3 ~ 0.2 observed for the DJI 
index (Fig. 0|. Thus, the algebraic volatility autocorre- 
lation function decay is reproduced by our scheme and 
provides a second criterion to fix consistently the anoma- 
lous scaling exponent D. 

Our approach is based on two postulates: 
inhomogeneous-time scaling and the vanishing of 
linear return correlations. These symmetries lead, in 
an unambiguous, deductive manner, to a model for 
the underlying stochastic process determining market 
evolution. Of course, the results follow only when the 
postulates are valid and we have shown that within 
specific time-ranges the consequences of these postulates 
are in remarkable agreement with the data. Major 



advances in understanding critical phenomena worked 
in a similar vein decades ago [20J, when the scaling 
assumptions allowed to establish links between seemingly 
disparate phenomena and put the basis for the develop- 
ment of renormalization group theory [27] . So far, the 
coexistence of anomalous scaling with the requirement 
of absence of linear correlation imposed by economic 
principles has been regarded as an outstanding open 
problem in the theory of stochastic processes. We believe 
that our solution could be relevant for developments in 
this field, as well as for describin g sc aling behaviors of 
other complex systems [U El El El El M El] • 
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